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(57) Cette invention conceme un precede permettant de 
cacher les donnees d'un message dans un support de 
donnees, ainsi qu'un precede permettant d'extraire les 
donnees de message qui ont ete cachees dans le support 
de donnees. Dans le premier precede, les donnees du 
message sont dispersees et cachees dans le support de 
donnees de maniere a ce qu'aucune tierce personne ne 
puisse les modifier. D'une maniere plus precise, les 
elements de reseau d'un reseau de message sont 
disperses et caches dans un reseau du support en fonction 
d'une valeur de statut S. Cette valeur designe un element 
de reseau specifique dans le reseau du support lorsque les 



(57) A data hiding method for hiding message data in 
media data and a data extracting method for extracting 
message data hidden in the media data. In the data hiding 
method, the message data are dispersedly hidden in the 
media data so that no third person can alter the message 
data. Specifically, the array elements of a message array 
are dispersedly hidden in a media array based on a status 
value S designating a specific array element in the media 
array when the media data and message data are 
expressed as the media array and the message array, 
respectively. The hiding method includes (a) a step of 
determining the j-th (j>0) status value Sj, (b) a step of 



OPIC 

Office de la propriete 
u Canada 




illf cipo 

~ Canadian Intellectual 

Property Office 



(2d (ad 2,230,071 

(86) 1997/02/13 

(87) 1997/12/24 



donnees du support et les donnees du message sont 
exprimees respectivement sous forme de reseau de 
support et de reseau de message. Ce premier precede 
comprend les etapes suivantes: (a) determiner la j-eme 
valeur de statut Sj (j>0); determiner la (j+l)-eme valeur 
de statut Sj+j en fonction de la valeur Sj, ainsi que 
l'element de reseau du reseau du support designe par la 
valeur Sj, et l'element de reseau susmentionne du reseau 
de message; et enfin (c), cacher les donnees qui doivent 
l'etre dans l'element de reseau du reseau du support 
designe par la valeur Sj+j. 



determining the (j+l)-th status value Sj+j based on the 
value Sj, the array element of the media array designated 
by the value Sj and the above-mentioned array element 
of the message array, and (c) a step of hiding data to be 
hidden in the array element of the media array designated 
by the value Sj+]. 
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DATA HIDING AND EXTRACTION METHODS 
ABSTRACT 

To provide a data hiding method of hiding media data in message data and a data 
extraction method of extracting the hidden data wherein message data is dispersively 
hidden in media data such as an image or sound to prevent a third person from modifying 
the message data easily. A data hiding method in which media data is expressed as a 
5 media array while message data is expressed as a message array so that the array 
elements of the message array can be dispersively hidden in the message array based on 
a state values specifying a particular array element of the media array, comprises the steps 
of: (a) determining the j-th (j 2 0) state value Sf, 

(b) determining (j + 1)-th state value S j+1 based on the j-th state value, the array 
10 element of the media array indicated by the j-th state value, and the array element of the 

message array; and 

(c) hiding data with respect to the array element of the media array indicated by the 
G + 1)-th state value S j+1 . 
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DATA HIDING AND EXTRACTION METHODS 
BACKGROUND OF THE INVENTION 

5 

TECHNICAL FIELD 

The present invention relates to a data hiding method for hiding message data in media 
1 o data and a data extraction method for extracting the hidden data. 

BACKGROUND ART 

Due to the development of multimedia-oriented society, a large amount of digitized image 
15 and sound information is distributed over Internet or as recording media such as CD- 
ROMs, DVD-ROMs (or DVD-RAMs), or DVCs. Since anyone can simply make a complete 
copy of such digitalized information which is unlikely to be degraded, its unfair use is a 
problem. To prevent a third person from unfairly copying media data such as image or 
sound data, attention is being paid to a hiding technology that hides in original media data, 
20 information such as the sign of the author. If digitalized image data is illegally copied, the 
sign hidden in the copy can be checked to identify its source in order to determine whether 
it has been illegally made. Such a technology is called "data hiding". 

BRIEF DESCRIPTION OF THE DRAWINGS 
25 Fig. 1 is a half tone image comprising digitalized data shown on a display; 

Fig. 2 schematically shows a layout of message data distributed over the image 
according to conventional methods; 

Fig. 3 describes a media array and a message array; 

Fig. 4 schematically shows the relationship among media array values used in 
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hiding processing; 

Fig. 5 is a schematic drawing describing an object to be hidden; 

Fig. 6 is a flowchart showing a procedure for hiding message data in media data; 

Fig. 7 shows the relationship between a state value S H and a state value S,- in 

hiding; 

Fig. 8 is a flowchart showing a procedure for extracting message data from hidden 

data; 

Fig. 9 shows the relationship between a state value Sj and a state value S H in 
extraction; 

Fig. 10 describes the determination of whether a generated message array value 
is the beginning of the message; 

Fig. 11 is a conceptual drawing showing a state in which an incorrect state 
sequence S has been generated due to modification; 

Fig. 12 describes data hiding and extraction methods using PBC; 

Fig. 13 describes a hiding method with PCB using a single pixel as a pixel block; 

Fig. 14 describes a method for hiding a message and positional information in an 
original image; 

Fig. 15 shows the case in which concentric circular arcs are used as positional 
information; 

Fig. 16 describes the identification of a reference position B in the case in which 
concentric circular arcs are used as positional information; 

Fig. 17 describes hiding and extraction methods that use concentric circular arcs 
as positional information; 

Fig. 18 is a block diagram of a broadcasting system; 

Fig. 19 is a block diagram showing a transmitter and a receiver in internet; 

Fig. 20 is a block diagram showing a server and a client; 

Fig. 21 is a block diagram of a finger printing and water marking system; 
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Fig. 22 is a block diagram of a data extraction system; and 
Fig. 23 is a block diagram of a semiconductor integrated circuit with the data 
extraction system provided on a single chip. 



5 Fig. 1 is a half tone image comprising digitalized data shown on a display. Photo 
description (messages) such as a "nurse", a "river", a "kindergarten pupil", and a "bird" is 
hidden in the media data, that is, the digitalized image in Fig. 1 (a), as shown in Fig. 1 (b). 
Media data can be obtained by dividing an image from a photograph into small points and 
determining the numerical values of the brightness and color of each point. In this case, 

1 o the original numerical values of the image are intentionally somewhat changed. Very small 
changes in the numerical values do not substantially disturb the image and are not 
perceived by users. This nature can be used to hide completely different information 
(message data) in the original image. Message data hidden in the image may include any 
information and may be, for example, a lattice pattern, a rule-like figure, or the sign of the 

15 producer of the image. Message data hidden in media data can be extracted by 
processing with a special program. Thus, based on the extracted message data, it can be 
determined whether the original media data has been modified. 

Each of the messages shown in Fig. 1 (b) is hidden near a meaningful region on the image. 

20 For example, the message "bird" is hidden near the region in which the bird in the image 
is present, and acts as a caption. To hide a message such as information on the owner 
in the image, the massage is preferably distributed over the image. This is because the 
message, which is locally hidden, may degrades the quality of that region when its amount 
is large. In addition, if part of the image is cut off, it is likely to be able to extract the 

25 massage if it is distributed over the image. Thus, how to determine where each message 
piece is hidden is important. The positions are determined on the bases of a state 
sequence S. That is, each element of the state sequences is associated with each 
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message and the message is hidden at the corresponding position determined on the 
basis of the element. 

According to conventional data hiding methods, the state sequence S has been 
5 determined by a random number sequence. Fig. 2 schematically shows a layout of 

message data distributed on an image according to conventional methods. The image 

shown in Fig. 1 (a) is divided into I image regions, each of which is numbered from 0 to 9. 

Then, message data is expressed as a message array (m), and an array element is 

described as an array value m[n] (0 <; n <; 9). Each array value is associated with divided 
10 message data. Each message is hidden at the position in the image specified on the basis 

of the element in the state sequence S. The following equations determine a position pO 

at which 0th message m[0] is hidden. 

[Equation 1] 
15 State value S 0 = initial value (constant) 
Position p 0 = S 0 mod I 

The above equations indicate that after a constant has been given as the initial vale of the 
state value S 0l the remainder of I (the number of image regions) relative to the state value 
20 S 0 is P 0 . The value of the position P 0 is any integral value between 0 and (1-1). By 
associating the integral values with the order of the image regions, the message m[0] is 
hidden in the i-th image region. The locations P n of the first and subsequent messages 
m[n] on the basis if the following equations. 



25 



[Equation 2] 
S n = Rnd (SJ 
P n = S n mod I 
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These equations indicate that the preceding state value is used as the seed of a 
random number to generate a pseudo random number array, which is used as the next 
state value S n . The remainder of I relative to the state value S n is the position P n . The 
message m[n] is hidden in the image region corresponding to this value. 

5 

The hidden message can be read only by users who know the state value S 0 (initial value). 
If the message is extracted, all the state values (S„ S 9 ) following the state value S 0 are 
calculated on the basis of the initial value. The positions corresponding to the state values 
are identified and the messages hidden at these positions are extracted. 

10 

As is seen in Fig. 2, since the state value S n used to determine the position on the image 
of the n-th message m[n] depends only on the state value S n . t the state value S n can be 
determined when the state value is determined. Likewise, the state value depends 
only on the state value S n . 2 . By recursively repeating this procedure, all the elements (S 0 , 
15 S 1f S 2 S 9 ) in the state sequence S can be determined. Thus, all the elements in the 
state sequence S depend only on the constant initially given as the initial value. 
Consequently, once the initial value has been identified, the state sequence S can be 
determined to identify all the positions of the messages distributed over the image and to 
extract their hidden contents. 

20 

If the initial value has been published or known to a third person despite efforts to make 
it secret, the third person can use the value to identify the positions of messages easily and 
erase the message data or write a different message data over the first data. It has been 
difficult for conventional methods to effectively prevent a third person from erasing the 
25 original sign to make the source unknown or writing a different sign over the original sign 
to act as if he or she was the author. 
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It is therefore an object of this invention to propose a new method for distributing and 
hiding message data in media data. It is another object of this invention to provide a data 
hiding method that prevents a third person from modifying message data. 

5 SUMMARY OF THE INVENTION 

To achieve the above objects, a first invention relates to a data hiding method of 
dispersively hiding the array elements of a message array in a media array based on a 
state value S specifying a particular array element in the media array if the media data is 
expressed as the media array while message data is expressed as the message array. 
1 o Specifically, the first invention comprises the steps of: 

(a) determining the j-th 0 * 0) state value Sj; 

(b) determining (j + 1 )-th state value S j+1 based on the j-th state value, the array element 
of the media array indicated by the j-th state value, and the array element of the message 

15 array; and 

(c) hiding data with respect to the array element of the media array indicated by the (j + 1 )- 
th state value S j+1 . 

If the message array has J array elements, the above steps (a) to (c) are recursively 
20 repeated until the J array elements have been hidden. If j = 0, the initial state value S 0 is 
determined on the basis of the data in the array elements of the message array. 
Specifically, an initial function for determining the initial state value S 0 is provided, and a 
value based on the data in ail the elements of the message array (for example, the 
exclusive OR of the data) is input to this initial function. 

25 

In step (b), the 0 + 1)-th state value Sj +1 is determined on the basis of the exclusive OR of 
the j-th state value Sj, the data in the array element of the media array indicated by this 
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state value, and the data in the array element of the message array. Specifically, a hiding 
position translation function is provided, and the above exclusive OR is input to this 
function to obtain the (j + 1)-th state value S j+1 as the output. The hiding position 
translation function may be, for example, a function using as a parameter, a secret key 
5 used in public-key cryptography. The hiding data may be the exclusive OR of the array 
element of the media array indicated by the j-th state value Sj and the array element of the 
message array. 

In addition, a second invention relates to a data extraction method of extracting a message 
10 array from a media array based on a state value specifying a particular array element in 
the media array if the message data is expressed as the message array and if hiding data 
including the message data is expressed as a hiding array while the media data in which 
the hiding data is dispersively hidden is expressed as the media array. The second 
invention comprises the steps of: 
1 5 (a) determining the j-th (j 1 1) state value Sj; 

(b) extracting an array elements of the hiding array from the array element of the media 
array indicated by the j-th state value S{, 

(c) determining the (j - 1) state value S H based on the j state value Sj and the extracted 
array element of the hiding array; and 

20 (d) extracting an array element of the message array based on the array element of the 
media array indicated by the (j - 1 )-th state value S H and the extracted array element of the 
hiding array. 

The steps (a) to (c) are recursively repeated until a extraction finish condition has been 
25 met. In step (a), the state value Sj used to start extraction is provided beforehand to an 
extractor as information required for extraction. The state value Sj may be, for example, 
the last state value generated when the message array was hidden. In step (c), the {j - 1 ) 
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state value is determined on the basis of the exclusive OR of the j-th state value Sj and 
the data in an array element of the hiding array. The array element of the hiding array is 
extracted from the array element of the media array indicated by the j-th state value Sj. In 
addition, an extraction position translation function may be provided beforehand, and in 

5 step (c), the exclusive OR of the j-th state value Sj and the data in the array element of the 
hiding array may be input to this function to obtain the 0 - 1 )-th state value S H as the 
output. The extraction position translation function may be, for example, a deciphering 
function using as a parameter, a secret key used in an enciphering key method. In step 
(b), the array element of the hiding array is be the exclusive OR of the array element of the 

1 o media array indicated by the (J - 1 )-th state value S H , the array element of the media array, 
and the array element of the message array. 

In addition, a third invention provides a data hiding method of hiding message data in 
media data, comprising the steps of (a) identifying one block (a pixel block in image data) 

15 in the media data in which the message data is to be hidden, (b) determining the 
characteristic value (for example, a pixel, brightness, or variance value) of the identified 
block, and (c) manipulating the characteristic value of the block so as to hide the message 
data by referencing a translation rule that associates the contents of the data to be hidden 
with the difference between a reference value for the characteristic value and the 

20 characteristic value of the block. 

The reference value may be the characteristic value of another block present in the media 
data. If the first and second blocks are identified in the media data, these blocks are paired 
and the characteristic value of each pair is determined. The characteristic values obtained 
25 are compared and manipulated on the basis of the translation rule (for example, by 
exchanging the characteristic values) so as to hide the message data. If the message data 
comprises a plurality of bits, the steps (a) to (c) are repeated. 
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A fourth invention provides a data extraction method of extracting message data from 
media data in which the message data is hidden, comprising the steps of (a) identifying 
one block in the media data in which the message data is hidden, (b) determining the 
characteristic value of the identified block, and (c) extracting the hidden message data 
according to the characteristic value of the block by referencing a translation rule that 
associates the contents of the data to be extracted with the difference between a reference 
value for the characteristic value and the characteristic value of the block. 

The reference value may be the characteristic value of another block present in the media 
data. If the first and second blocks are identified in the media data, these blocks are paired 
and the characteristic value of each pair is determined. Then, the hidden message data 
is extracted according to the characteristic value of each block by referencing the 
translation rule that associates the contents of data to be extracted with the difference in 
the characteristic values of the blocks. If the characteristic value of the first block is larger 
than that of the second block, this translation rule extracts one of the bits, and otherwise, 
extracts the other bit. 

A fifth invention relates to a system for implementing the above data extraction method. 
That is, the fifth invention provides a data extraction system for extracting message data 
from media data in which the message data is hidden, comprising a translation means for 
translating the media data in which the message is hidden, into digital signals, the media 
data being transmitted as analog signals; an identification means for identifying one block 
in the media data in which the message data is hidden, the media data being transmitted 
as the output of the translation means; a characteristic value calculation means for 
determining the characteristic value of the block identified by the identification means; a 
storage means for storing a translation rule that associates the contents of data to be 
extracted with the difference between a reference value for the characteristic value and the 
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characteristic value of the block; and an extraction means for extracting the hidden 
message data according to the characteristic value of the block. 

Furthermore, a sixth invention relates to a semiconductor integrated circuit with the 
5 functions of the above data extraction system provided on a single chip. That is, the sixth 
invention provides a semiconductor integrated circuit for extracting message data from 
media data in which the message data is hidden, comprising a means for determining a 
characteristic value of one block identified as the one in which the message data is hidden; 
and an extraction means for extracting the hidden message data according to the 
10 characteristic value of the block by referencing a translation rule that associates the 
contents of data to be extracted with the difference between a reference value for the 
characteristic value and the characteristic value of the block. 

15 BEST MODE FOR CARRYING OUT THE INVENTION 

A. Data definition 

First, the following arrays and sequences are defined. 

(1) Media array: M 

(2) Message array: m 
20 (3) State sequence: S 

(4) Position sequence: p 

(1) Media array M 

Media data includes image and sound data. Media data, in which message data is 
25 embedded, is defined with a media array M, and a media array element M[i] that is an array 
element of the media array are expressed as follows. 
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[Equation 3] 

M: {M 0l M, M M,.,} or M[i] 0 <. i s 1-1 I: Length of media data 

If the media data is, for example, an image as shown in Fig. 1 (a), the image is divided into 

5 l image regions with the first one having a 0-th media array value M[0], as shown in Fig. 
3(a). The i-th image region has a media array value M[i], and the last image region has a 
media array value M[l-1 ]. Each media array value has data that is the image information 
of the image region to which it corresponds to. The image information is a density for 
monochrome screens, while it is brightness for color screens. If the number of image 

10 regions is equal to the number of pixels, the media array value M[i] is the pixel value of the 
i-th pixel. If the image region comprises a plurality of pixels, for example, 3 * 3 pixels, the 
image information is respective pixel values. If the media data is sound, the media array 
value M[i] can be defined as a amplitude value at time (i). The data in each array value 
M[i] is assumed to be expressed as a BM byte integer. 

15 (2) Message array (m) 

Message data to be embedded in media data includes, for example, information on the 
producer of the image, management information on a production number, a date, and a 
place, and information on permission for duplicates. Message data is defined with a 
message array (m) and a message array value m[j] that is an array element of the 

20 message array are expressed as follows: 

[Equation 4] 

m: {m 0> m„ .... rfy rrvJ or mQ] OsUJ-1 J: Length of media data 

25 If the message data is, for example, as shown in Figure 3(b), "DATAHIDING" comprising 
1 0 alphanumeric characters, the j-th letter is associated with a message array value m[j-1 ], 
which thus has data indicating the corresponding letter. The data in each array value m[j] 
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is assumed to be expressed a B m byte (1 byte for alphanumeric characters) integer. In this 
case, the message data has a length J of 1 0. 
(3) State sequence S 

The state sequence S is defined to determined the locations (media array values) that are 
5 subjected to hiding, and a state value Sj that is an element of the sequence are expressed 
as follows. 

[Equation 5] 

S: {S 0 , Sn Sj} or Sj 0 s j <; J J: Length of message data 

10 

An algorithm for generating the state sequence S is one of the important factors of this 
embodiment. It should be noted that the number of the elements J of the state sequence 
S is (J + 1), which is larger than the number of the elements J of the message array (m) 
by one. 

15 (4) Position sequence (p) 

The position sequence (p) is used to identify the positions that are subjected to hiding, 
using the following equation. A position pj that is an element of the position sequence (p) 
is expressed as follows. 

20 [Equation 6] 

p: {p 0! Pi, Pj} or ^ 0 <. j * J J: Length of message data 
Pj = Sj mod I 

The position sequence (p) has (J + 1 ) elements as in the state sequence S. The position 
25 pj that is an element of the position sequence is determined as the remainder of I relative 
to the state value with the same index value (j), that is, Sj. Thus, the value of the position 
Pj is an integer between 0 and (I - 1 ), and the image region corresponding to this value is 
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subjected to hiding. Since (I) is an constant that is the number of image regions, the value 
of the position Pj can be uniquely determined once the state value S, has been established. 
Thus, the positions that are subjected to hiding are virtually identified by the state 
sequence S. 

5 

B. Hiding algorithm 

The data hiding method according to this invention is closely related to an algorithm for 
extracting hidden data. That is, a third person who attempts to extract hidden data can 
view the message if he or she has specified information. In this case, it is important to 
1 o effectively prevent the third person from modifying the message during extraction. In view 
of this point, this invention hides data based on the following three characteristics. 

(1) Determination of the state value S 0 based on the characteristics of the message 

(2) Determination of the state sequence S based on the message and image data 

(3) Data to be hidden 

15 

(1) Determination of the state value S 0 based on the characteristics of the message 
The state value S 0 that is the first element of the state sequence is the output of an initial 
function f| N , to which the exclusive OR of all the elements (m[0], m[1j, m[9]) of the 
message array. The state value S 0 and the position p 0 determined on the basis of this 
20 value are determined by the following equations. 

[Equation 7] 

S 0 = fiNi (m[0] XOR m[1] XOR m[2] XOR ... XOR M[n-1]) 
p 0 = S 0 mod I 

25 

Conventional methods have determined the initial state value S 0 as a particular constant 
regardless of the contents of the message to be embedded. According to this 
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embodiment, however, this element is determined on the basis of the contents of the 
message, that is, all the message array values. Fig. 4 schematically shows the distribution 
of message data over an image. The initial position p 0 is identified from the initial state 
value S 0 as shown in the figure. 

5 

The determination of the initial state value S 0 from the characteristics of the message array 
(m) is one of the characteristics of the present algorithm. This not only makes the 
determination of the initial state value complicated but also effectively prevents a third 
person from illegally writing data over the message. The initial function f, N , is also used as 
1 o a function for determining the end of the message during extraction. This prevents a third 
person from writing a different message other than the original message. 

(2) Determination of the state sequence S based on the message and image data 
The next state value S, is determined from the state value S 0 . The state value S 1 is 
1 5 identified by the following equation. 

[Equation 8] 

S, = SK (S 0 XOR m[0] XOR M[pJ) 

20 That is, the next state value S! is determined as the output of a function SK to which the 
exclusive OR of the current state value S 0 , the media array value M[p 0 J indicated by the 
state value S 0 , and the message array value m[0] is input. The media array value M[p D ] is 
the image region at the position p 0 calculated from the state value S 0 . This is true of the 
state value S 2 and subsequent state values, and this relationship can be generally 

25 expressed as the following equation. 

[Equation 9] 
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S j+1 = SK (SjXOR m[fl XOR M[pj]) 

The function SK is a position translation function for determining the next state value and 
receives as an input, the exclusive OR of the current state value, a message array value, 

5 and a media array value. Thus, the state value is sequentially determined by recursively 
executing the above equation. Since (J + 1) state values must be determined, the last 
state value is Sj. If the message array has 10 elements as in Fig. 3(b), 1 1 state values are 
generated. By determining the state sequence S in this manner, all the image regions that 
are subjected to hiding are identified. Fig. 4 shows the relationship among media array 

10 values that are subjected to hiding. 

While a position sequence according to conventional techniques depends only on the initial 
value, the position sequence according to the present algorithm is determined by taking 
the message and media array values into consideration. If a third person attempts to write 
15 a different message over the original message, the overwriting is very difficult because a 
different state sequence is generated. This is also true if a third person attempts to modify 
the image data. 

(3) Data to be hidden 

20 Fig. 5 is a schematic drawing describing an object to be hidden. The position sequence 
(p) that undergoes hiding is assumed to have been identified. First, data is hidden in the 
position p, and subsequent positions, whereas no data is hidden at the position p . 0 
Specifically, the result of the exclusive OR of the media and message array values is 
subjected to hiding. 



First, the media array value M[0] is simply taken from the position p 0 . Then, the exclusive 
OR of the message array value m[0] and the media array value M[0] obtained is 
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determined, and the result is hidden in the position This hiding operation changes the 
contents of the media array value Mfc,]. Furthermore, as the processing at the position 
P 2 , the exclusive OR of the message array value m[1] and the media array value M[1], the 
contents of which have been changed by the preceding processing is determined, and the 
5 result is hidden in the position p 2 . 

Data hiding is completed by repeating such hiding processing until the position p 10 . The 
exclusive OR of the message array value m[9] and the media array value M[9], the 
contents of which have been changed by the preceding processing is hidden in the position 
10 p 10 . The number of the elements of each of the state sequence S and the position 
sequence (p) is larger than that of the message array because no data is hidden in the 
position p 0 . 

According to the present algorithm, if hidden message is extracted, the extraction is 
15 recursively executed in the reverse order starting with the last message array value m[9]. 
The hiding of the results of the exclusive OR of the message and media arrays according 
to the present algorithm is closely related to the message extraction procedure. The 
details are described in the "Message data extraction" section. In addition, various 
algorithms for actually hiding data are conceivable. In a second embodiment, PBC (Pixel 
20 Block Coding) is explained as an example of an algorithm for hiding data. 

C. First embodiment 

A first embodiment is specifically described in conjunction with the procedure in Fig. 6. Fig. 
6 is a flowchart showing a procedure for hiding message data in media data. In the 
25 following description, reference Fig. 4 or 5 as appropriate. Calculation of the initial state 
value (step 100) 

To hide data, the position sequence (p) must be determined. To do this, the initial position 
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p[0] is first determined. The initial position p[0] is determined as the output of the initial 
function f, N , using the initial state value S 0 as an input, and the variable S 0 is determined by 
the following equation. 

5 [Equation 10] 

S 0 = f, N , (m[0], m[1], m[2], .... M[J-1]) 
-H1 (m[0]//m[1]//m[2]//...//M[J-1]) 

In the above equation, H1 is a hash function. In addition, the operator "//" means that the 
10 elements of the message array are connected together. The specific operation may be, 
for example, the exclusive OR of the data in the array elements. If, however, the exclusive 
OR is used, the order of the message array values is not reflected in the results of 
calculation. That is, the "DATAHIDING" in Fig. 3(b) has the same value as 
"TADAHIDING". Then, for example, the method called CRC (Cyclic Redundancy Check) 
15 can be used to reflect the order relationship. This algorithm is one of the algorithms for 
calculating check sums, and generates outputs that depend on the contents and order of 
the data array. 

From an input with a byte length of B m bytes (the array value m[i]), the hash function H1 
20 generates an output with a different byte length K (a hashed value). Since this is a one- 
way function, it is virtually impossible to estimate (x) from (y) when H(x) = y. 

This K byte hashed value is used as the initial state value S 0 for data hiding. The hashed 
value is simply used as an initial value for data hiding, so it must only be ensured that 
25 different outputs results from different inputs. Thus, the hashed value has no particular 
meaning. Importantly, the operation results in the output of a value indicating the 
characteristics of the array, that is, the hashed value is uniquely determined on the basis 
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of the contents of all the array elements and may depend on the contents of the overall 
array. 

If the message data is the "DATAHIDING" shown in Fig. 3(b), the output of the hash 
5 function H1 from the exclusive OR of the data indicating all the alphanumeric characters 
(the data in the array value m[i]) is the state value S„. The remainder if I (the number of 
image regions) relative to the state value S 0 is the position p 0 . This allows the state value 

5 0 and the position p 0 to be obtained as the initial state values. 
Pixel value extraction (step 200) 

10 The image information in the image region corresponding to the value of the position p 0 
obtained in step 100 is determined. If, for example, p 0 = i, this is the media array value 
M[i]. This data is expressed as an B M byte integer. 
Next station calculation (step 300) 

In addition to the state value S 0 determined in step 100, the next state value S, is 
1 5 determined from the media array value M[p 0 ] and the media array value m[0] identified from 
the state value S„. The state value S1 can be determined by the following equation. 

[Equation 11] 

5 1 = SK (S 0 XOR m[0] XOR M[pJ) 
20 (XOR is an exclusive OR operation) 

That is, the exclusive OR of the state value S 0 , the message array value m[0] and the 
media array value M[p 0 ] is determined, and the result is input to the function SK. Then, the 
output of the function SK is the state value S,. This function SK is called a position 
25 translation function, and a position translation function SK for hiding message data in 
media data is particularly called a hiding position translation function. Since the position 
translation function SK is closely related to an extraction position translation function PK 
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for extracting hidden data, it is also described in detail in the following description of the 
extraction position translation function. 

The state value S 2 and subsequent state values can be determined by executing a similar 
5 procedure. To generalize this, the j-th state value S s and position p, can be expressed by 
the following equations. 

[Equation 12] 

Sj = SK (S H XOR my-1]XOR M[p H ]) 
10 pj = Sj mod I 

(SK: hiding position translation function) 

It should be again noted that the position pj depends on the contents of the message and 
media data as well as the preceding state value S H . 

15 

The state value S H , message array value mQ-1], and media array value M[p H ] used as 
inputs to the hiding position translation function SK have different byte lengths: K, B m , and 
B M bytes, respective. It is of course possible to calculate the exclusive OR of inputs with 
different byte lengths, but the exclusive OR is preferably calculated after converting the 
20 inputs so that they has the same byte length. The following two hash functions H2 and H3 
are used to convert the inputs so that all of them have a length of K bytes as in the state 
value. 

H2: a hash function for generating a K byte hashed value from a B m byte integer 
25 H3: a hash function for generating a B m byte hashed value from a B M byte integer 



That is, the hash function H2 is used for the message array value m[j-1] to generate a K 
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byte hashed value from the B m byte integer in the array value. In addition, the hash 
function H3 is used for the media array value M[p H ] to translate the integer with a bit length 
of B M bytes into a B m byte integer. The hash function H2 is further used to generate a K 
byte hashed value. Equation 7 described above includes the case in which inputs have 
5 been converted by the hash functions in this manner. 
Message embedding (step 400) 

The exclusive OR of the message array value mfj-1] and the media array value M[p H ] is 
determined and the data obtained is the hiding array value MmQ-1]. The data in the hiding 
array value Mmfj-1] is hidden in the position p l determined in step 300. The hiding array 
1 o value MmO-1 ] is an array element of the hiding array Mm. 

[Equation 1 3] 

Mm: {Mm[0] f Mm[1], MmQ], .... Mm[J-1]} 
MmQ] = mQ] XOR M[p,] 

15 

It should be appreciated that the hidden data is not the message array value mQ] but the 
hiding array value Mm[j] generated from the exclusive OR of the message array value mfj] 
and the media array value M[pj]. The hiding array value Mm[j-1] is hidden in the position 
Pi by the function X. This changes the data in the media array value M[pJ. The media 
20 array value M[pJ the contents of which have been changed is called M'[pJ. For the specific 
contents of the function X, that is, the algorithm for hiding the hiding array in its specified 
position, PBC is described in the second embodiment as one example. 
Determination (step 5QQ ) 

It is determined whether the value of (j) is equal to J (a value larger than the number of the 
25 elements in the message array by one). If j = J, all the message array values have been 
hidden. If (j) is smaller than J, 1 is added to (j) and the process returns to step 200. Steps 
200 to 400 are then recursively executed until (j) becomes equal to J. This procedure 
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enables a state sequence S and a position sequences each including (J + 1) elements to 
be obtained. 

[Equation 14] 

State sequence S: {S 0 , S 1f S 2 , .... Sj} 
Position sequence P: {p 0 , p„ p 2l .... p d } 

Due to the hiding of the data, the elements of the media array M are changed as follows. 
[Equation 15] 

(Media array prior to hiding) 

M:{M[0], .... M[p 0 ], .... M[ P1 ] M[p„]} 

(Media array after hiding) 

M': {M'[0], M'[Po], M'[ Pl ], .... M'[p M ] - {M[0] M[p 0 ] M'[ Pl ] M'[p M ] 

M' means the media array after hiding, and M'[i] means that the contents of the media array 
value prior to hiding have changed. That is, the media array value M' means that only the 
contents of the J array elements of the original message array M which are identified by 
the state sequence S have changed. Since the array element M[p 0 ] corresponding to the 
state value S 0 is subjected to no change in data, M'[p 0 ] is equal to M[p 0 ]. 

Fig. 7 shows the relationship between the state value S H and S f in hiding. The state value 
Sj can be determined by calculating the exclusive OR of the state value S H and the hiding 
array Mm[j-1] (M'[p H ] XOR m[j-1]) and inputting the result to the position translation 
function SK. The media array value M'[p H ] has already had its contents changed during 
the hiding of the preceding state. In addition, the hiding array Mm[p H ] is hidden in the 
position pj by the function X. This changes the data in that position M[pj], 
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State value extractio n (step 600) 

The hiding of the message is finished by extracting the following two pieces of information. 

(1) Media array M' after hiding 
5 (2) Final state value Sj 

The media array M' is finally obtained by hiding the data in the media data at step 400. 
The final state value Sj is the J-th state value that is calculated last. These information are 
required when extracting the message data as described below. When extracting the 
1 o hidden message, the extractor does not need to know the state values SO to SJ-1 prior to 
(J-1 )-th so long as he or she knows only the J-th state value. This is because these state 
values can be identified by inversely converting the final state value. Of course, the 
extractor must have required information described below. 

15 (Message data extraction) 

The following three pieces of information are provided to a third person to extract message 
data. 

(1) Media array M' after hiding 
20 (2) Final state value Sj 

(3) Extraction position translation function PK 

Information (1) and information (2) are finally generated during the embedding of the 
message data. The extraction position translation function PK in the information (3) is 
25 described in conjunction with the hiding position translation function. 

a. Hiding position translation function SK, extraction position translation function PK 
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As the hiding position translation function SK for generating the next state value has been 
defined in hiding the message, the extraction position translation function PK for generating 
the preceding state value is defined in extracting the message. The hiding and extraction 
position translation functions SK and PK are reverse to each other as shown in the 
5 following equations. 

[Equation 16] 
PK (SK(x)) = x 
SK (PK(x)) = x 

10 

Thus, the initial message array (m) can be extracted by using the hiding position translation 
function SK to convert the message array (m) and then using the inverse function PK to 
further convert the result. 

15 Various functions that meet the above equations are conceivable. To virtually prevent a 
third person from reproducing the hiding position translation function SK using the provided 
extraction position translation function PK, an enciphering function and a deciphering 
function in a public enciphering method are preferably used. There are several algorithms 
according to the public key method, and any of them may be used. By way of example, 

20 the typical RSA method is explained. The algorithm of the RSA method is as follows. 

1 . Two large prime numbers P, Q are selected and calculate n = p * q. 

2. Calculate r = 1cm(p-1 , q-1) and select (d) such that gcd(d, r) = 1 . 

3. Determine (e) such that e * d = 1 (mod r) and 0 < e < r. 

25 4. Publish (e) as a public key and also publish (n). Make (d) secret as a secret key. 

5. To encipher a message (m), calculate (c) such that m e = c(mod n). This (c) becomes 
the encrypted message. 
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6. To decipher an enciphered message, calculate (m) such that c d = m(mod n). This (m) 
is the deciphered message. 

By applying this algorithm to this embodiment, the following hiding and extraction functions 
5 SK and PK are obtained. 

[Equation 1 7] 
SK(m) = x d (mod n) 
PK(m) = x e (mod n) 

10 

That is, a publisher who has produced the image has a secret key (d) and uses it to obtain 
the hiding function SK. The function SK is used to generate the position state sequence 
S for embedding the message data and embeds the data in the media data. A third 
person, who has obtained this distributed data, uses the public key (e) to obtain the 

15 extraction function PK. The third person uses the function PK to generate the position 
state sequence S in order to read the message. In the public key method, the public key 
is published to third people but only the author who has produced the image has the secret 
key, so the third people cannot know the contents of the image. Thus, they cannot know 
the contents of the hiding position translation function. Since the calculation of the secret 

20 key from the public key requires an enormous amount of calculations, it is almost 
impossible for a third person to know the hiding position translation function SK. This 
method can effectively prevent a third person from modifying the original data. 

The important nature of exclusive ORs required to explain the message extraction 
25 procedure is described in brief. Exclusive OR operations have the following nature. When 
the exclusive OR of A and B is determined and the exclusive OR of this result and B is then 
determined, A is reproduced. 
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[Equation 18] 
(AXOR B) XORB = A 

(Extraction algorithm) 

5 Fig. 8 is a flowchart showing a procedure for extracting embedded message data, 
initial state value calculation (step 1 1Q) 

First, the position p i in which the message array value m[J-1] is hidden is determined from 
the final state value Sj that has been provided to the extractor, according to the following 
10 equation. 

[Equation 19] 
Pj - Sj mod I 

15 Hiding value extraction (step 120) 

The media array M'[pJ has the hiding array value Mm[J-1] hidden in the original image 
information in the position pj. The array Mm[J-1] has been determined as the exclusive 
OR of the media array value M'fpj.,] and the message value array m[J-1 J. A function X' is 
then defined to extract the hiding array value Mm[J-1] from the media array value M'[pj]. 

20 An example of the specific contents of the function X' in the following equation is described 
in the second embodiment. 

[Equation 20] 

Mm[J-1] = function X' (M'[pJ) 

25 

Preceding state calcul ation (step 130 ) 

Next, the preceding state value Sj., is determined by calculating the exclusive OR of the 
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state value Sj and the hiding array value Mm[J-1] determined from the function X'. It 
should be noted that the mathematical nature of the above exclusive OR is used in this 
case. 

5 [Equation 21] 

PK (SJ XORX' (M«[ Pj j) 

= (Sj.! XOR Mm[J-1]) XOR Mm[J-1] 

= Sj, 

1 o That is, since the extraction position translation function PK is reverse to the hiding position 
translation function, it can be used to reproduce from the state value Sj, the result of the 
exclusive OR of the state value Sj., and the hiding array value Mm[J-1]. The exclusive OR 
of this result and the hiding array value Mm[J-1] determined in step 1 20 can be determined 
to identify the preceding state value Sj.,. 

15 

Message calculation (step 1 40) 

Once the state Sj.! has been determined, the corresponding media array value M'^] can 
be identified. The message array value m[J-1] can then be extracted using the following 
equation. If the message in Fig. 3(b) has been hidden, this step allows the trailing letter 
20 "G" to be extracted. 

[Equation 22] 
M'IPj-i] XOR Mm[J-1] 
= M'Ipj.,] XOR (M'IRm] XOR m[J-1]) 
25 =m[J-1] 

The extraction of state or message array values can be generally expressed as follows 
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using the index value (j) (1 s j s J). 

[Equation 23] 

5 PK(S,)XORX , (M , [p j ]) 

= (S H XORMmO-1]) XOR Mmfl-1] 
= S,, 

mfl-1]: 

10 M'^] XOR Mmfl-1]) 

= M , [p H ] XOR (M't^lXOR mfl-1]) 
= mfl-1] 

Fig. 9 shows the data relationship indicated by the above equations. This figure shows the 
1 5 relationship between the state value S, and the state value Sj.-, in extraction. A comparison 
of this figure with Fig. 7 shows that the extraction procedure is reverse to the hiding 
procedure. 

Determination (S te p 150 ) 

Each time the message array value m[j] is generated, it is determined whether it is the 
20 beginning of the message array. If so, all the elements of the message array (m) have 
been extracted. Since the message array (m) is extracted in the reverse order starting with 
the last element, the message is extracted in the order of "GNIDIHATAD" in the example 
of Fig. 3(b). At step 160, the order of the message is reversed to generate a complete 
message. If the message array value mfl] is not the beginning of the message array, steps 
25 120 to 140 are recursively executed until the result of the determination at step 140 
becomes positive. 
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The determination of whether or not an element of the message array (m) is the beginning 
is equal to the determination of whether the generated state value Sj matches the output 
of the hash function H1 . The exclusive ORs of all the elements of the message array (m) 
which have already been generated are input to the hash function H1 . If the state value 
5 Sj, which has been generated in the reverse order, and the output of the hash function H1 
has the relationship shown in the following equation, the value of j is zero (see Equation 
10). 

[Equation 24] 

10 Sj - f (m[J-1 ] XOR m[J-2] XOR ... XOR m[j] = S 0 

The above equation is established only when j = 0 and not in the other cases (j / 0). Each 
time the final state value Sj is generated in the reverse order, it is compared to the output 
of the hash function based on all the extracted message array values. Fig. 10 describes 

15 the determination of whether the generated message array value is the beginning of the 
message. First, the state value S M preceding the final state value Sj is determined and 
the message array value m^ is determined. The output F of the initial function (f) obtained 
when the message array value m H has been input does not match the state value Sj.,. If 
the state value S 0 is determined, it matches the output F of the initial function and the 

20 message array values is determined to be the beginning. Thus, due to the following 
characteristics, the present algorithm s very useful in preventing a third person from 
modifying the message. 

(1) Prevention of the overwriting of a different message 
25 According to conventional techniques, the generated position sequence S has depended 
only on an initially provided constant regardless of the contents of the message. By simply 
knowing the constant, third people have been able to erase the message or overwrite a 
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different message over an original message. According to the present algorithm, however, 
the generation of the state sequence also depends on the contents of the message, 
different messages result in different position sequences. Since the hidden position is 
based on the position sequence, a different message with different contents cannot be 
5 hidden in a position at which the original message is present. This is also true of media 
data. 

(2) Prevention of the hiding of a different message 

The present algorithm checks the condition shown in Equation 24 to determine the end of 
10 extraction. This condition is met only when the contents of an extracted message are the 

same as those of the original message and when the extracted message has the same 

length as the message data (J). Extraction is not finished unless this requirement is met. 

Thus, if an attempt is made to hide data other than the original message in the reverse 

order based on the final state value Sj, the requirement in Equation 24 is never met and 
15 the calculation continues permanently. Consequently, attempts to hide a message in 

positions other than those in which the original message is present fail because the 

calculation is not finished. 

If a third person has modified the message array values as shown in Fig.s 1 0 and 1 1 , a 
20 different state array S' is generated on the basis of the modified message. This prevents 
the third person from erasing or overwriting the original message located in positions other 
than those positions identified by the state sequence S\ The state sequence S' generated 
by modification as shown in Fig. 1 1 never allows the determination that extraction has been 
finished. Since the writing of different messages can be virtually prevented, modification 
25 by third people can be effectively prevented. 

D. Second embodiment 
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This section describes Pixel Block Coding (hereafter referred to as "PBC") that is one of 
the methods of embedding in media data to be hidden and extracting the hidden data. 
With PBC, data hiding and extraction can be processed according to the translation rule 
described below. 

5 

(Basic Algorithm) 

The primary characteristics of the pixel values of two adjacent pixels generally have a high 
correlation. Thus, even if the pixel values are exchanged between adjacent pixels, the 
image will not be visibly degraded. In view of this nature, the present algorithm defines an 
10 image region with at least one pixel as a pixel block, and hides one bit data by intentionally 
exchanging the characteristic values of adjacent pixel blocks based on a particular 
translation rule. That is, data is expressed by exchanging the characteristic values of 
adjacent pixel blocks. In addition, data is extracted according to an extraction rule 
determined by this translation rule. 

15 

Bit information is expressed by exchanging the characteristic values (for example, 
brightness) of two adjacent pixel blocks according to the following translation rule. 

Bit on <1 >: the characteristic value of one pixel block (PB,) is larger than the characteristic 
20 value of the other pixel block (PB 2 ) 

Bit off <0>: the characteristic value of one pixel block (PB,) is smaller than the 
characteristic value of the other pixel block (PB 2 ) 

In addition, bit information is extracted by comparing the characteristic values (for example, 
25 brightness) of two adjacent pixel blocks according to the following extraction rule. 

The characteristic value of one pixel block (PB,) is larger than the characteristic value of 
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the other pixel block (PB 2 ): bit on <1 > 

The characteristic value of one pixel block (PB,) is smaller than the characteristic value of 
the other pixel block (PB 2 ): bit off <0> 

5 Fig. 12 describes data hiding and extraction using PBC. The pixel blocks PB1 and PB2 
may be defined as a set of a plurality of pixels, for example, 3x3 pixels or one pixel may 
be defined as a one pixel block. Since adjacent pixel blocks have a high correlation, the 
exchange of their positions does not visually degrade the image (Fig. 12a). 

10 The positions of the pixel blocks in the original image are assumed to be as shown in Fig. 
12(b). It is also assumed that as the result of a comparison of the characteristic values of 
two pixel blocks, the characteristic value of PB 1 is larger than that of PB 2 . If data "1" is 
hidden in the original, the characteristic values of the pixel blocks have already met the 
condition for the data "1" in the translation rule and are thus not exchanged. When the 

15 data is extracted, the data "1 " is extracted because the extraction rule specifies that this 
be done if the characteristic value of PB1 is larger. 

On the other hand, if data "0" is hidden in the original, the characteristic values of the pixel 
blocks of the original are exchanged because the relationship between these characteristic 
20 values does not meet the condition for the data "0". This exchange, however, cannot be 
visually recognized. During extraction, the data "0" is extracted on the basis of the 
relationship between the characteristic values of these pixel blocks. 

In this manner, PBC selects from the image a sufficient number of pixel blocks to hide 
25 information to be hidden. One of the selected pixel blocks and the adjacent pixel block are 
then paired to generate an array of such pairs. The pairs are sequentially hidden starting 
with the beginning of the array. 
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This array may be associated with the state sequence S in the first embodiment. For 
example, a pixel block is associated with the array element (m) of the media array M in the 
first embodiment. Each array element (state value S) sequentially generated during hiding 
and the adjacent media array value are paired. Pairs obtained are then subjected to the 
5 above processing. It is also possible to make such determinations based on a pseudo 
random number sequence generated from particular seeds of random numbers. 

During extraction, the same block array as in hiding is scanned. The overall image is 
extracted by collecting one bit after another according to the extraction rule by determining 
10 whether each pair represents "bit on" or "bit off". If the characteristic values of the paired 
pixel blocks are the same, this pair is skipped as in hiding. By making the block array or 
the array generation method secret, the hidden information can be hidden from third 
people. 

15 In PBC, the embedding positions are preferably determined in view of image quality and 
extraction accuracy. That is, if the difference in the characteristic value between the paired 
pixel blocks is too large, image quality may be degraded by the exchange operation. To 
restrain such degradation of image quality, a first threshold (an upper limit) is preferably 
provided such that if the difference in the characteristic value is equal to or larger than the 

20 threshold, no bit is embedded in that pair. 

In addition, if the difference in the characteristic value is sufficiently small, image quality is 
not substantially degraded by the exchange operation. In this case, however, noise may 
reverse the magnitude of the characteristic values, thereby preventing the embedded bit 
25 from being extracted. Thus, to restrain such a decrease in extraction accuracy, a second 
threshold (a lower limit) is preferably provided such that if the difference in the 
characteristic value is equal to or smaller than the threshold, no bit is embedded in that 
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pair. 

Those pairs which correspond to these cases are skipped without being subjected to 
operations. The bit information to be hidden is then passed to the next pair. 

5 

(Block characteristic values) 

The values of the primary and secondary characteristics of pixel blocks can be used as the 
characteristic values. The primary characteristics are direct pixel value parameters such 
as the brightness and chromaticity of pixel blocks. The secondary characteristics are those 

10 values such as the mean and variance of the above parameters which indicate statistical 
natures, and can be obtained by analyzing the primary characteristics. Furthermore, the 
characteristic value may be the result of an operation with an array comprising a plurality 
of pixel values and a specified array (mask) or a specified element value obtained by 
frequency conversion. The primary characteristics exhibit a high correlation between two 

15 adjacent pixel blocks. The secondary characteristics, however, exhibit a high correlation 
between two separated blocks that are not adjacent to each other. It should thus be noted 
that pixel blocks that are subjected to PBC are not necessarily limited to adjacent blocks. 
The brightness value that is a primary characteristic and the variance value that is a 
secondary characteristic are described below as examples of the characteristic value of 

20 pixel blocks. 

First, the use of the brightness value as the characteristic value of pixel blocks is 
described. If a single pixel is associated with a pixel block, the brightness value of this 
pixel can be directly used as the characteristic value of this block. In natural images, 
25 adjacent pixels often have a high correlation, so image quality is not substantially degraded 
even if they are exchanged. Fig. 13 describes hiding with PCB executed when a pixel 
block comprises a single pixel. 
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Next, the use of the variance value as the characteristic value is described. When a pixel 
block comprises n x m pixels, image quality may be significantly degraded, for example, 
a striped pattern may occur on the image if the brightness values of pixels are exchanged 
between blocks. Thus, it is not preferable to directly use the pixel value as the 
5 characteristic value of blocks. Consequently, the use of the variance value of pixel 
brightness as the characteristic value is contemplated. 

It is known that when the nature of the brightness value of a pixel block is divided into a 
mean value (h) and a variance value (d), image quality is not substantially degraded if the 
10 variance values (d) are exchanged between adjacent pixel blocks with the mean values 
(h) remaining unchanged. Thus, based on this nature, the data can be hidden by using the 
variance value (d) as the characteristic value of pixel blocks and exchanging these values 
between adjacent blocks according to the translation rule. 

15 It is assumed that the pixel block PB, has a mean value h, and a variance value d, while 
the pixel block PB 2 is assumed to have a mean value h 2 and a variance value d 2 , as shown 
in Fig. 12(c). If the bit "1" is hidden, the condition for this bit is not met because d, < d 2 . 
Then, only the variance values (d) are exchanged between the two pixel blocks. This is 
equal to the exchange of the crests of the distributions between the two pixel blocks with 

20 their mean values (h) remaining unchanged. 

(Amount of information that can be hidden) 

In PBC, the upper limit of the amount of information that can be hidden is determined as 
follows using the sizes of the image and the pixel block. 

25 

(Image size)/(pixel block size)/2 [bit] 
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maintained. Thus, even if a hue adjustment or a correction is carried out, the hidden 
information is expected to be maintained on the basis of the difference in the characteristic 
value. In addition, if, in the variance value exchange method described above, the pixel 
block has a size of 8 * 8, the hidden information can be extracted accurately even after 
5 JPEG compression processing. Our experiment showed that 90% of information was 
maintained even after "compression with loss" was carried out so as to reduce the file size 
down to 5%. The variance value exchange method is expected to effectively maintain the 
hidden information even after D/A and A/D conversions such as print/scan operations. 

10 (Extension of PBC) 

It should be noted that the above PBC is only illustrative and that many other methods are 
conceivable. As apparent from the above embodiment, the manipulation of the 
characteristic value of pixel blocks according to the rule on the difference between 
characteristic values is important in data embedding and extraction. In this sense, in 

1 5 addition to the exchange of characteristic values as described above, a specific value may 
be added to one of the characteristic values or subtracted from the other (or both 
manipulations may be executed simultaneously). In this case, the specified value may be 
a constant or be varied depending on the conditions of the pixel blocks to be processed. 
In addition, as an extension of PBC, a rule for associating the signs of characteristic values 

20 with binary information may be defined to embed or extract data according to this rule. 

The essence of this invention is to use a certain reference value (for example, one of 
adjacent pixel blocks in image data) to manipulate the other pixel block according to a 
meaningful rule. That is, data is hidden by manipulating the characteristic value of one of 
25 adjacent pixel blocks by referencing a translation rule that associates the contents of the 
data to be hidden with the difference between a reference value for the characteristic value 
and the characteristic value of this block. In this sense, according to this invention, the 
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If, for example, 1 * 1 pixel blocks are applied to a 384 * 256 image, the upper limit of the 
amount of information that can be hidden is 6 Kbytes. The actual limit, however, is smaller 
because all the pixel block cannot be used for hiding as in the case in which adjacent pixel 
blocks have the same characteristic value. In addition, this amount may be further reduced 
5 because despite the possibility of hiding, the exchange processing is occasionally avoided 
to prevent image quality from being degraded. 

(Image preservation and degradation) 

Two adjacent pixels on an image with an edge feature extending between the pixels are 
10 known to have significantly different brightness values. Thus, the exchange of such two 
pixels may break the edge feature, resulting in the obvious degradation of image quality. 
Thus, to prevent the degradation of image quality, a certain threshold is provided for the 
brightness values to be exchanged. If this threshold is exceeded, this pair can be 
effectively skipped without exchanging the characteristic value. The threshold may be 
15 determined on the basis of a variance value calculated from the image data or, local 
variance values around the blocks. 

If the variance value of a block which is close to zero is exchanged with the variance value 
of another block which is far from zero, the small block is significantly changed, resulting 
20 in the obvious degradation of image quality. Thus, the smaller variance value may be 
compared to the threshold, and if it is smaller than the threshold, the exchange operation 
may be avoided. 

(Endurance of PBC) 

25 Since the characteristic values of adjacent pixel blocks are compared and the data 
associated with the exchange of the characteristic values is hidden, the hidden information 
can be retrieved correctly so long as the relative relationship between the blocks is 
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reference need only be clear and able to manipulate characteristic values, and need not 
necessarily be a specified region (a pixel block) in image data. Thus, the reference may 
be obtained from data other than image data. For example, a fixed mask pattern with a 
specified value (a reference value) and the same size as a pixel block may be used as a 
5 reference for characteristic value manipulation. In this case, during hiding, the 
characteristic value of a specific pixel block may be manipulated relative to the reference 
value in the mask pattern, and during extraction, data may be extracted on the basis of the 
difference from the reference value. 

10 E. Third embodiment 

This section describes a method of hiding information the reference position. The above 
data hiding technique selects and processes pixel blocks from an original image and 
extracts a message from the processed pixel blocks. Thus, the positional information on 
pixel blocks is essential in extracting messages. The positions of pixel blocks are relatively 

15 identified using a certain region in the image (in the first embodiment, the upper left corner 
of the original image) as a reference. If, however, a third person edits the image, for 
example, cuts off part of the image, the reference position of the image cannot be 
identified, thereby preventing the message from being extracted. Referencing Fig. 3(a) 
again, if part of the original image (the area enclosed by the broken line in th figure) is cut 

20 off, the original reference position (the position M[0] of the 0-th array element in the 
message array) cannot be determined from the remaining screen. 

The important requirements for data hiding are that hidden information cannot be removed 
easily by a third person or can be extracted correctly even if a third person has maliciously 
25 modified the data or if the data has been subjected to image compression with loss such 
as JPEG. Then, in addition to a message, information for identifying the reference position 
is preferably hidden in the original image as shown in Fig. 14. Hiding such positional 
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information enables the message to be extracted correctly even from a modified or 
compressed image. Thus, information for identifying the reference position for media data 
is also hidden in the message data. This information on the reference position is hidden 
all over the message data. If part of the message data has been cut off, this information 
5 enables the reference position for the message data or any position relative to the 
reference position to be detected from the partial message data that has been cut off. For 
example, concentric circular arcs around the reference position may be hidden all over the 
image, as described below. 

10 (Use of concentric circular arcs as positional information) 

Concentric circular arcs as shown in Fig. 15 are used as positional information. These 
concentric circular arcs are drawn at a specified interval around the upper left corner (the 
reference position) of the original image. To correctly extract the reference position from 
a partial image that has been cut off, at least one circular arc must be included in a partial 

15 image assumed to be cut off. Consequently, the interval among the concentric circular 
arcs is set assuming the above conditions. 

The reference position can be identified using the nature of a circle "if a circle is drawn 
using a point on the circumference of another circle as the center in such a way that both 

20 circles have the same radius, the drawn circumference will pass through the center of the 
second circle". Fig. 16 describes the identification of a reference position B in the case in 
which concentric circular arcs are used positional information. First, any three points (a,, 
a 2 , a 3 ) present on three concentric circular arcs (C^ Ca, CJ, respectively, present in the 
partial image that has been cut off are identified. Next, circles with different radii (r 1f r 2 , r 3) 

25 r 4 , ...) are drawn from the respective points. In this case, one of the intersections of the 
circles at which the largest number of circles intersect one another is identified as the 
reference position B. Since, in digital images, pixels are diversely arranged like a lattice, 
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the reference position can be identified very accurately by drawing circles as described 
above. 

For circles (with larger radii) remote from the upper left corner (the reference position), the 
5 interval between the circular arcs may be reduced in such a way that the partial image 
includes two or more circular arcs. As the radius of a circular arc becomes larger and 
larger, the circular arc contains more and more linear elements, resulting in large 
differences in the calculated values of the reference position. Reducing the interval 
enables such differences to be reduced because the partial image includes a large number 
10 of circular arcs. In addition, by setting the reference position, that is, the center of 
concentric circular arcs, in both upper left and lower right corners, the size of the original 
image can be determined from these two centers. 

(Hiding with concentric circular arcs) 

15 Fig. 17 describes hiding and extraction methods that use concentric circular arcs as 
positional information. In Fig. 17(a), a message is hidden in the original image using the 
method shown in the second embodiment, and concentric circular arcs are subsequently 
hidden therein as positional information. For example, the LBP of the pixels on the 
concentric circular arcs is set at "1 ". LBP is used because changes in this value cause few 

20 visible changes in the image. In addition, in Fig. 17(b), positional information is hidden in 
the original image, and a message is subsequently hidden therein. 

(Extraction with concentric circular arcs) 

A two-dimensional vote array T is created. The size of the array is determined by 
25 considering the maximum size of a partial image expected to be cut off. If, for example, 
the array has a width of (2m -1), it can deal with partial images 1/m times as large as the 
original image at maximum. Each of the elements of the vote array T represents a 
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candidate in the reference position B, and the value of the element represents the number 
of vote for the candidate. Both coordinates are associated with each other in such a way 
that the partial image is located in the center of the vote array. 

5 The partial image is scanned, and when a point with an LBP of "1 " is encountered, circular 
arcs of all the known radii on the vote array are drawn around this point. The number of 
votes for each element of the vote array T is then increased by one. In this case, if this is 
the point on the circumference at which the point corresponding to each element is hidden, 
at least one of the drawn circular arcs passes through the original center. 

10 

The partial image is alternatively scanned vertically and horizontally, and the above 
procedure is carried out each time a point with an LBP of "1 " is encountered. This allows 
the position corresponding to the element with the largest number of votes to be identified 
as the reference point B. The largest number of votes concentrates on the array element 
15 of the vote array which corresponds to the reference position. Thus, the present algorithm 
enables the reference position to be identified correctly despite the effects of noise. The 
message can then be extracted appropriately from the partial image based on the identified 
reference position. 

20 Since the method in Fig. 17(a) changes image data with a message hidden therein, the 
extraction of the information may fail unless a data hiding method resistant to random noise 
is used. The method in Fig. 17(b), however, does not change the hidden message until 
it is extracted, so it is more resistant to noise than the method in Fig. 17(a). As a result, 
this method allows various data hiding methods to be used. It should be noted, however, 

25 that part of the information on the reference position according to this invention may be 
broken. 
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F. Specific applications 

Specifically, data hiding and extraction methods using the above algorithm are applicable 
to the following system. 

5 (Television image CM occurrence count) 

Whether a commercial is broadcasted a requested number of times is important to the 

person who has ordered the broadcasting. The present algorithm can be used to construct 

a system as shown in Fig. 18 in order to automatically count the number of broadcasts. 

The broadcast station has a storage apparatus, an encoder, and a broadcasting apparatus. 
10 The storage apparatus stores a commercial image, and the encoder hides in the 

commercial image, information used to count the number of broadcasts. The commercial 

image with this information hidden therein is broadcasted by the broadcasting apparatus. 

This system uses the above data hiding method to encode the commercial image. 

Encoding is carried out based on the data in the counter information, which is dispersively 
15 hidden in the commercial image. In addition, a receiving station has a receiver, a decoder, 

and a counter. The receiver receives the commercial image with the counter information 

hidden therein. The decoder extracts the counter information from the commercial image. 

Based on the extracted counter information, the counter counts the number of broadcasts. 

Such a reception system extracts the counter information based on the above data 
20 extraction method. Decoding is carried out by identifying and extracting data from those 

positions on the commercial image at which the counter information is distributed, based 

on the data in the counter information. 

The broadcast station broadcasts the commercial image with the hidden counter 
25 information that enables the number of broadcasts to be counted at the receiving station 
based on the hiding algorithm. Therefore, the receiving station can count the number of 
broadcasts by extracting this counter information. 
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(Censorship at a firewall) 

All the HTTP transmissions inside and outside a topology such as Firewall flow through a 
proxy located at the topology. This nature can be used to censor images and sound data 
on the proxy. If media data with a water mark hidden therein is detected, it is left in a log. 

5 This enables illegally distributed data to be detected. Fig. 1 9 is a block diagram of such 
a system. A server has a storage means in which media data is stored and an encoder 
for hiding message data in the media data. The server controls the encoder so as to hide 
message data in the media data read from the storage means, and transmits the output 
data to Internet. The above hiding method is then used to dispersively hide the message 

1 o data in the media data based on the contents of the message data. A receiving station has 
a proxy that receives from Internet the media data that has been subjected to the hiding 
processing at the transmitting station, and a decoder for extracting the message data from 
the received data. Extraction is executed by using the above data extraction method to 
identify the positions of the distributed message data based on the contents of the 

15 message data. 

(Application to travel agency servers) 

Additional information such as the description or map of a sight-seeing place, or point 
information such as URL is hidden in a photograph of the sight-seeing place displayed on 

20 a WWW server operated by a travel agency. By using the WWW Browser to network-copy 
the photograph, the map can be extracted later from the photograph to check the 
description, transportation, and route. As shown in Fig. 20, the additional information can 
be extracted even after the client has been disconnected from the WWW server. 
Compared to the method of separating a file for additional information from a file for photo 

25 data, the use of hiding enables the close relationship between the photo data and the 
additional information to be maintained, thereby allowing the data to be filed easily. 
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(Finger printing and water marking) 

Finger printing refers to the hiding in media data to be issued to a third person, of a mark 
that enables the issuer of the media data to identify that person. If anyone conducts an 
illegal act such as illegal copying, finger printing enables the source of the act to be 
5 identified. Thus, if this person is illegally distributing the copy, this illegal copy can be 
charged on him or her. 

On the other hand, water marking refers to the hiding in media to be issued to a third 
person, of a mark that the issuer of the data to identify him- or herself. This ensures that 
1 o this data has not been modified during distribution, that is, has been issued from the legal 
issuer and not modified during distribution. 

To hide such marks, a system such as shown in Figure 21 can be constructed. When data 
is issued to a third person, a mark is hidden in the data. To detect the mark, it is extracted 
1 5 from the data obtained from the third person. 

In addition to the above systems, this invention is applicable to those systems which use 
cable or satellite communication or storage media such as DVD-ROMs (DVD-RAMs). In 
particular, if a DVD-ROM (DVD-RAM) is used to distribute media data, the data includes 
20 hidden information on a permission for copying, that is, whether copying is prohibited or 
allowed. DVD players used by end users to duplicate the media data include a function 
for limiting copying. Thus, if the copying player extracts a copy permission condition from 
the medium to find that copying is prohibited, it operates to prohibit copying. 

25 G. Data extraction systems and semiconductor integrated circuits implementing such 
systems 
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Specifically, the above data extraction method can be implemented on systems of the 
following configuration. In addition, most of the functions of such a system can be provided 
on a single chip as a semiconductor integrated circuit. The detailed description of the 
characteristics of such systems and semiconductor integrated circuits is omitted to avoid 
5 duplication, but the items and their extensions and variations described in the data 
extraction method are also applicable to these systems. The details of the following items 
will be obvious to those skilled in the art from the description of the above methods. 

Specifically, a system for extracting message data from media data has a translation 
1 o means, an identification means, a characteristic value calculation means, a storage means, 
and an extraction means, as shown in Figure 22. The translation means is an A/D 
converter that converts message data comprising analog signals in which message data 
is hidden, into digital signals, which are then output by the converter. The identification 
means identifies one of those blocks in media data output by the translation means in 
15 which message data is hidden. The characteristic value calculation means determines the 
characteristic value of the block identified by the identification means. The storage means 
stores a translation rule that associates the contents of the data to be extracted with the 
difference between a reference value for the characteristic value and the characteristic 
value of this block. The extraction means references the translation rule to extract the 
20 hidden message data depending on the characteristic value of this block. 

The reference value is the characteristic value of a second block that is present in the 
media data and that is different from the first block. The translation rule stored in the 
storage means states that one of the bits is extracted if the characteristic value of the first 
25 block is larger than that of the second block whereas the other bit is selected if the 
characteristic value of the first block is smaller than that of the second block. 
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In addition, a semiconductor integrated circuit comprising the above system incorporated 
in a single chip has, as shown in Fig. 23, at least a calculation means and an extraction 
means. The calculation means determines the characteristic value of that block in media 
data comprising input signals which is identified as the one in which message data is 
5 hidden. The extraction means extracts the hidden message according to the characteristic 
value of the block by referencing the translation rule that associates the contents of the 
data to be extracted with the difference between a reference value for the characteristic 
value and the characteristic value of this block. 

10 INDUSTRIAL APPLICABILITY 

As described above, if message data is dispersively hidden in media data such as an 
image or sound, this invention determines those positions in which the message is to be 
hidden based on the contents of both media and message data. Therefore, this invention 
• enables data hiding that prevents a third person from modifying the message easily. 
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CLAIMS 

The embodiments of the invention in which an exclusive property or privilege is claimed are 
defined as follows: 



1 1. A data hiding method in which media data is expressed as a media array while 

2 message data is expressed as a message array and in which the elements of said message 

3 array are dispersively hidden in said media array based on a state value specifying a 

4 particular array element of said media array, comprising the steps of: 

5 (a) determining the j-th 0 * 0) state value; 

6 (b) determining + 1 )-th state value based on said j-th state value, the array element 

7 of said media array indicated by the j-th state value, and the array elements of said 

8 message array; and 

9 (c) hiding data with respect to the array element of said media array indicated by said 

10 (j + 1)-th state value. 

1 2. A data hiding method in which media data is expressed as a media array while 

2 message data is expressed as a message array and in which the J elements of said 

3 message array are dispersively hidden in said media array based on a state value 

4 specifying a particular array element of said media array, comprising the steps of: 

5 (a) determining the j-th (j 2 0) state value; 

6 (b) determining (j + 1 )-th state value based on said j-th state value, the array element 

7 of said media array indicated by the j-th state value, and the array elements of said 

8 message array; 

9 (c) hiding data with respect to the array element of said media array indicated by said 

10 (j + 1 )-th state value; and 

11 (d) recursively executing steps (a) to (c) to hide the J elements of said message 
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array. 

1 3. A data hiding method according to Claim 1 or 2 wherein if j = 0 in said step (a), the 

2 initial state value is determined on the basis of the data in the array elements of said 

3 message array. 

1 4. A data hiding method according to Claim 1 or 2 wherein if j = 0 in said step (a), the 

2 exclusive OR of the data in all the array elements of said message array is used as an input 

3 to said initial function, the output of said initial function being used as said initial state value. 

1 5. A data hiding method according to Claim 1 or 2 wherein in said step (b), said 0 + 1 )- 

2 th state value is determined on the basis of the exclusive OR of said j-th state value, the 

3 data in the array element of the media array indicated by the j-th state value, and the data 

4 in the array elements of said message array. 

1 6. A data hiding method according to Claim 1 or 2 wherein in step (b), the exclusive OR 

2 of said j-th state value, the data in the array element of said media array indicated by the 

3 j-th state value, and the data in the array elements of said message array is used as an 

4 input to said position translation function, the output of said position translation function 

5 being used as said (J + 1 )-th state value. 

1 7. A data hiding method according to Claim 6 wherein said hiding position translation 

2 function is an enciphering function that uses as a parameter, a secret key used in the public 

3 key method. 

1 8. A data hiding method according to Claim 1 or 2 wherein, in said step (c), said hiding 

2 data is the exclusive OR of the array element of said media array indicated by said j-th state 
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3 value and the array elements of said message array. 

1 9. A data hiding method according to Claim 1 or 2 wherein, in said step (c), one of the 

2 pixel blocks which corresponds to the array element of said media array indicated by said 

3 (j + 1 )-th state value and a second block are paired and wherein the characteristic values 

4 of the pixel blocks are manipulated based on the translation rule so as to hide the hiding 

5 data. 

1 1 0. A data hiding method according to Claim 9 wherein said second block is adjacent to 

2 said first block constituting a pair with the second block. 

1 11. A data hiding method according to Claim 9 wherein said translation rule specifies a 

2 rule used to manipulate said characteristic values based on the difference in the 

3 characteristic value between the respective pixel blocks constituting a pair. 

1 12. A data hiding method according to Claim 1 1 wherein said characteristic value is the 

2 brightness value of said pixel block. 

1 1 3. A data hiding method according to Claim 1 1 wherein said characteristic value is the 

2 variance value of said pixel block. 

1 1 4. A data hiding method according to Claim 2 further including the step of (d) extracting 

2 said media array in which said hiding data is hidden; and the last state value. 

1 1 5. A data extraction method in which message data is expressed as a message array, 

2 in which hiding data including said message data is expressed as a hiding array while 

3 media data with said hiding data dispersively hidden therein is expressed as a media array, 
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4 and in which said message array is extracted from said media array based on a state value 

5 specifying a particular array element of said media array, comprising the steps of: 

6 (a) determining the j-th (j * 0) state value; 

7 (b) extracting an array element of said hiding array from the array element of said 

8 media array indicated by said j-th state value; 

9 (c) determining the (j - 1)-th state value based on the j-th state value and said 

10 extracted array element of the hiding array; and 

1 1 (d) extracting an array element of said message array based on the array element 

12 of said media array indicated by said (j - 1 )-th state value and the extracted array element 

13 of said hiding array. 

1 1 6. A data extraction method in which message data is expressed as a message array, 

2 in which hiding data including said message data is expressed as a hiding array while 

3 media data with said hiding data dispersively hidden therein is expressed as a media array, 

4 and in which said message array is extracted from said media array based on a state value 

5 specifying a particular array element of said media array, comprising the steps of: 

6 (a) determining the j-th (j * 1 ) state value; 

7 (b) extracting an array element of said hiding array from the array element of said 

8 media array indicated by said j-th state value; 

9 (c) determining the (j - 1)-th state value based on the j-th state value and said 

I o extracted array element of the hiding array; 

I I (d) extracting an array element of said message array based on the array element 

12 of said media array indicated by said (j - 1 )-th state value and the extracted array element 

13 of said hiding array; and 

14 (e) recursively executing steps (a) to (c) until an extraction end condition is met. 

1 17. A data extraction method according to Claim 15 or 16 wherein in said step (a), the 



JA9-96-044 



50 



2 first state value used in starting extraction is provided to an extractor as information required 

3 for extraction. 

1 1 8. A data extraction method according to Claim 1 7 wherein the first state value used 

2 to start extraction is the last state value generated in hiding said message array. 

1 1 9. A data hiding method according to Claim 1 5 or 1 6 wherein in step (c), said (j - 1 )-th 

2 state value is determined on the basis of the exclusive OR of said j-th state value and the 

3 data in the array element of said hiding array extracted from the array element of said media 

4 array indicated by the j-th state value. 

1 20. A data hiding method according to Claim 15 or 16 wherein an extraction position 

2 translation function is provided, and wherein in said step (c), the exclusive OR of said j-th 

3 state value and the data in the array element of said hiding array extracted from the array 

4 element of said media element indicated by said j-th state value, the output of said position 

5 translation function being used as said (j - 1 )-th state value. 

1 21. A data extraction method according to Claim 20 wherein said extraction position 

2 translation function is an deciphering function that uses as a parameter, a public key used 

3 in the public key method. 

1 22. A data extraction method according to Claim 1 5 or 1 6 wherein, in said step (b), the 

2 array element of said hiding array is the exclusive OR of the array element of said media 

3 array indicated by said (j - 1 )-th state value and the array elements of said message array. 

1 23. A data hiding method according to Claim 1 5 or 1 6 wherein, in said step (b), the pixel 

2 block corresponding to the array element of said media array indicated by said j-th state 
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3 value and a second block are paired and wherein the hidden data is extracted according 

4 to the relationship between the characteristic values of said pixel blocks constituting a pair 

5 on the basis of an extraction rule. 

1 24. A data extraction method according to Claim 23 wherein said second block is 

2 adjacent to said first block constituting a pair with the second block. 

1 25. A data extraction method according to Claim 23 wherein said extraction rule is based 

2 on the difference in the characteristic value between said pixel blocks constituting a pair and 

3 corresponds to the translation rule that specifies the manipulation of said characteristic 

4 values during hiding. 

1 26. A data extracting method according to Claim 25 wherein said characteristic value is 

2 the brightness value of said pixel block. 

1 27. A data hiding method according to Claim 25 wherein said characteristic value is the 

2 variance value of said pixel block. 

1 28. A data extraction method according to Claim 1 6 wherein an initial function used in 

2 hiding data in order to determine the initial state value based on the data in the array 

3 elements of said message array, and wherein the method further includes the steps of: 

4 (f) in said step (d), each time an element of said message element is extracted, using 

5 the data in all the array elements of said message array as an input to determine as said 

6 extraction end condition that the output of said initial function is equal to said Q - 1 )-th state 

7 value. 

1 29. A data extraction method according to Claim 28 wherein the exclusive OR of the data 
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2 in all the extracted array elements of said message array is used as an input to said initial 

3 function. 

1 30. A data extraction method according to Claim 27 wherein in step (f), the end of 

2 extraction is recognized when the output of said initial function is equal to said (j - 1 )-th state 

3 value. 

1 31 . A broadcast system comprising: 

2 a storage apparatus in which a commercial image is stored; 

3 an encoder for hiding in said commercial image, information used to count the 

4 number of broadcasts of said commercial image; and 

5 a broadcast apparatus for broadcasting said commercial image with said information 

6 hidden therein, wherein: 

7 said information is dispersively hidden in said commercial image based on the data 

8 in said information for counting. 

1 32. A broadcast system according to Claim 31 wherein the method for hiding said 

2 information in said commercial image is the data hiding method set forth in any of Claims 

3 1 to 14. 

1 33. A receiving system comprising: 

2 a receiver for receiving a commercial image including hidden information used to 

3 count the number of broadcasts of said commercial image; 

4 a decoder for extracting said information from said commercial image; and 

5 a counter for counting the number of broadcasts of said commercial image based on 

6 said extracted information, wherein: 

7 based on the data in said information for counting, the positions of said dispersively 
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8 hidden information in said commercial image are identified and the information is then 

9 extracted. 

1 34. A receiving system according to Claim 33 wherein the method for extracting said 

2 information from said commercial image is the data extraction method set forth in any of 

3 Claims 15 to 30. 

1 35. A system for transmitting data to a network, comprising: 

2 a storage apparatus in which media data is stored; 

3 a coding apparatus for hiding message data in said media data; and 

4 a server for controlling an encoder so as to hide said message data in said media 

5 data read from said storage means and transmitting the output data from said encoder to 

6 the network, wherein: 

7 said message data is dispersively hidden in said media data based on the contents 

8 of said message data. 

1 36. A system for transmitting data to a network according to Claim 35 wherein said hiding 

2 method is the data hiding method set forth in any of Claims 1 to 1 4. 

1 37. A system for receiving data from a network, comprising: 

2 a receiver for receiving media data with message data hidden therein from the 

3 network; and 

4 a decoder for extracting said message data from said media data; wherein: 

5 based on the contents of said message data, the positions of said dispersively 

6 message data in said media data are identified and the message data is then extracted. 

1 38. A system for receiving data from a network according to Claim 37 wherein said 
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2 extraction method is the data extraction method set forth in any of Claims 1 5 to 30. 

1 39. A data hiding method in which first data is expressed as a first array while a second 

2 data is expressed as a second array and in which the elements of said second array are 

3 dispersively hidden in said first array based on a state value specifying a particular array 

4 element of said first array, comprising the steps of: 

5 (a) determining the j-th (j ;> 0) state value; 

6 (b) determining 0 + 1 )-th state value based on said j-th state value, the array element 

7 of said first array indicated by the j-th state value, and the array elements of said second 

8 array; and 

9 (c) hiding data with respect to the array element of said first array indicated by said 

10 0 + 1)-th state value. 

1 40. A data extraction method in which second data is expressed as a second array while 

2 third data including said second data is expressed as a third array and in which first data 

3 with said third data dispersively hidden therein is expressed as a first array, said second 

4 array being extracted from said first array based on a state value specifying a particular 

5 array element of said first array, comprising the steps of: 

6 (a) determining the j-th (j ;> 1 ) state value; 

7 (b) extracting an array element of said third array from the array element of said first 

8 array indicated by said j-th state value; 

9 (c) determining the (j - 1)-th state value based on the j-th state value and the 

10 extracted array element of said third array; and 

1 1 (d) extracting an array element of said second array based on the array element of 

12 said first array indicated by said 0 - 1 )-th state value and the extracted array element of said 

13 third array. 
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1 41 . A data hiding method for hiding data in media data wherein: 

2 information on the reference position for the said message data and said media data 

3 is hidden in said message data, and wherein 

4 the information on said reference position is hidden in said message data and 

5 enables said reference position of the original message data or a relative position based on 

6 said reference position to be detected even if part of said message data is cut off as partial 

7 message data. 



1 42. A data hiding method for hiding data in media data according to Claim 41 wherein 

2 the information on said reference position is concentric circular arcs drawn around said 

3 reference position. 



1 43. A data hiding method for hiding data in media data comprising the steps of: 

2 (a) identifying one of those blocks in the media data in which the message data is 

3 to be hidden; 

4 (b) determining the characteristic value of said identified block; and 

5 (c) manipulating the characteristic value of said block so as to hide the message data 

6 by referencing a translation rule that associates the contents of the data to be hidden with 

7 the difference between a reference value for the characteristic value and the characteristic 

8 value of said block. 



1 44. A data hiding method for hiding data in media data according to Claim 43 wherein 

2 said reference value is provided by the characteristic value of a different block present in 

3 the media data. 

1 45. A data hiding method for hiding data in media data comprising the steps of: 

2 (a) identifying a pair of blocks in the media data which comprise a first block and a 
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3 second block; 

4 (b) comparing said determined characteristic values and manipulating the 

5 characteristic values of said blocks constituting said pair so as to hide the message data; 

6 (c) comparing the difference between said determined characteristic values, and 

7 based on the translation rule, manipulating the characteristic values of said respective 

8 blocks constituting said pair so as to hide part of the data in the message data; 

9 (d) repeating steps (a) to (c) to hide all the message data. 

1 46. A data hiding method for hiding data in media data comprising the steps of: 

2 (a) identifying a pair of blocks in the media data which comprise a first block and a 

3 second block; and 

4 (b) determining the characteristic values of said respective blocks constituting said 

5 pair; 

6 (c) exchanging the characteristic values of said blocks constituting said pair so as to 

7 hide the message data, based on a translation rule that specifies the difference in the 

8 characteristic value between the blocks. 

1 47. A data hiding method according to Claim 44, 45, or 46 wherein said second block is 

2 adjacent to said first block constituting a pair with the second block. 

1 48. A data hiding method according to Claim 44, 45, or 46 wherein said translation rule 

2 states that one of the bits is hidden if the characteristic value of said first block is larger than 

3 that of the second block, while the other bit is hidden if the characteristic value of said first 

4 block is smaller than that of the second block. 

1 49. A data hiding method according to Claim 44, 45, or 46 further including the step of 

2 prohibiting the execution of said step (c) if the difference in the characteristic value between 
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3 said first block and said second block is larger than a first threshold, thereby preventing the 

4 media data from being degraded. 

1 50. A data hiding method according to Claim 44, 45, or 46 further including the step of 

2 prohibiting the execution of said step (c) if the difference in the characteristic value between 

3 said first block and said second block is smaller than a second threshold, thereby 

4 preventing the degradation of the accuracy in extracting the message data. 

1 51 . A data hiding method according to Claim 44, 45, or 46 wherein said block is a pixel 

2 block, and wherein said characteristic value is the brightness value of said pixel block. 

1 52. A data hiding method according to Claim 43, 45, or 46 wherein said block is a pixel 

2 block, and wherein said characteristic value is the variance value of said pixel block. 

1 53. A data extraction method for extracting message data from media data with the 

2 message data hidden therein comprising the steps of: 

3 (a) identifying one of those blocks in the media data in which the message data is 

4 hidden; 

5 (b) determining the characteristic value of said identified block; and 

6 (c) extracting the message data according to the characteristic value of said block 

7 by referencing a translation rule that associates the contents of the data to be extracted with 

8 the difference between a reference value for the characteristic value and the characteristic 

9 value of said block. 

1 54. A data extraction method according to Claim 53 wherein said reference value is 

2 present in the media data and is the characteristic value of a block different from said block. 
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1 55. A data extraction method for extracting message data from media data with the 

2 message data hidden therein comprising the steps of: 

3 (a) identifying a pair of blocks in the media data in which the message data is hidden; 

4 (b) determining the characteristic values of said blocks constituting said pair; and 

5 (c) extracting the hidden data according to the characteristic value of the said 

6 respective blocks by referencing a translation rule that associates the contents of the data 

7 to be extracted with the difference in the characteristic value between said blocks 

8 constituting said pair. 

1 56. A data extraction method according to Claim 54 or 55 wherein said second block is 

2 adjacent to said first block constituting a pair with the second block. 

1 57. A data extraction method according to Claim 54 or 55 wherein said translation rule 

2 states that one of the bits is extracted if the characteristic value of said first block is larger 

3 than that of the second block, while the other bit is extracted if the characteristic value of 

4 said first block is smaller than that of the second block. 

1 58. A data extraction method according to Claim 54 or 55 wherein said translation rule 

2 determines that no media data is hidden in the pair if the difference in the characteristic 

3 value between said first block and said second block constituting the pair is smaller than a 

4 specified threshold. 

1 59. A data extraction method according to Claim 53 or 55 wherein said block is a pixel 

2 block, and wherein said characteristic value is the brightness value of said pixel block. 

1 60. A data extraction method according to Claim 53 and 55 wherein said block is a pixel 

2 block, and wherein said characteristic value is the variance value of said pixel block. 
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1 61 . A data extraction system for extracting message data from media data with the 

2 message data hidden therein, comprising: 

3 a converter for converting the message data comprising analog signals and including 

4 the hidden message data, into digital signals and for outputting the digital signals 

5 an identifier for identifying one of those blocks in the media data in which the 

6 message data is hidden; 

7 a characteristic value calculator for determining the characteristic value of said block 

8 identified by said identifier; 

9 a storage for storing a translation rule that associates the contents of the data to be 

I o extracted with the difference between a reference value for the characteristic value and the 

I I characteristic value of said block; and 

1 2 an extractor for extracting the message data according to the characteristic value of 

13 said block by referencing said translation rule. 

1 62. A data extraction system according to Claim 61 wherein said reference value is 

2 present in the media data and is the characteristic value of a block different from said block. 

1 63. A data extraction system according to Claim 61 wherein said translation rule states 

2 that one of the bits is extracted if the characteristic value of said first block is larger than that 

3 of the second block, while the other bit is extracted if the characteristic value of said first 

4 block is smaller than that of the second block. 

1 64. A semiconductor integrated circuit for extracting message data from media data with 

2 the message data hidden therein, comprising: 

3 a device for determining the characteristic value of one of the blocks in the media 

4 data comprising input signals which is identified as the one in which the message data is 
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5 hidden; and 

6 an extractor for extracting the hidden message data according to the characteristic 

7 value of said block by referencing a translation rule that associates the contents of the data 

8 to be extracted with the difference between a reference value for the characteristic value 

9 and the characteristic value of said block. 

1 65. A semiconductor integrated circuit according to Claim 64 wherein said reference 

2 value is present in the media data and is the characteristic value of a block different from 

3 said block. 

1 66. A semiconductor integrated circuit according to Claim 65 wherein said translation rule 

2 states that one of the bits is extracted if the characteristic value of said first block is larger 

3 than that of the second block, while the other bit is extracted if the characteristic value of 

4 said first block is smaller than that of the second block. 
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